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THE HEIGHT OF THE MIXED SPARSE RESULTANT 


MARTIN SOMBRA 


Abstract. We present an upper bound for the height of the mixed sparse resultant, 
defined as the logarithm of the maximum modulus of its coefficients. We obtain a 
similar estimate for its Mahler measure. 


Let Aq, ..., An C Z” be finite sets of integer vectors and let Res^o,...,^„ G Z[[/o,..., Un] 
be the associated mixed sparse resultant — or (Aq, • • •, An)-resultant — which is a 
polynomial in n + 1 groups Ui := {Uia ; a G Ai\ of rrii := variables each. We 
refer to | Stu94 | and | CL09j , Chapter 7] for the definitions and basic facts. 

This resultant is widely used as a tool for polynomial equation solving, a fact that 
sparked a lot interest in its computation, see e.g. [|CL098 , Sec. 7.6], | EM99|| , | D’An02 |, 
[[JKSS02 |, while it is also studied from a more theoretical point of view because of 
its connections with toric varieties and hypergeometric functions, see e.g. | GKZ94 |, 
[PDS98II . 


We assume for the sequel that the family of supports Ao,--.,An is essential (see 
[ 3tu94 , Sec. 1]) which does not represent any loss of generality, by | Stu94 , Cor. 1.1]. 
Set A := (Aq, ..., An), and let L_a C Z"' denote the Z-module affinely spanned by the 
pointwise sum YLl=o^i- This is a subgroup of Z” of finite index 

[Z” : Lj^] ■= #(Z7Ta) 


because we assumed that the family A is essential. Also set Qi := Conv(Ai) C M” for 
the convex hull of A* for z = 0,... n. 

We note by MV the mixed volume function as defined in e.g. |CL098| , Sec. 7.4]: this 
is normalized so that for a polytope P C M”, the mixed volume MV(P ,... ,P) equals 
n! times the Euclidean volume VolRn(P). We also set Vol(P) := MV{P,...,P) = 
n! VolRn(P). 

Under this notation and assumption, the resultant is a multihomogeneous polynomial 
of degree 


deg[/,(Res^o,...,^„) 


1 

[Z" : La\ 


MV{Qo,... ,Qi-i,Qi+i,... ,Qn) >0 


with respect to each group of variables Ui, see 


|PS9|, Cor. 2.4], 


The absolute height of a polynomial g = Yla Ca G C[xi,..., Xn\ is defined as H{g) := 
max{ ICa I; a G N”}. Hereby we will be mainly concerned with its (logarithmic) height: 


h{g) := logH{g) = logmax{|ca| ; a G N”}. 


Date-. November 28, 2002. 

2000 Mathematics Subject Classification. Primary 11G50; Secondary 13P99. 

Key words and phrases. Mixed sparse resultant, height of polynomials, Mahler measure. 
Supported by a Marie Curie Post-doctoral fellowship of the European Community Program Im¬ 
proving Human Research Potential and the Socio-economic Knowledge Base, contract n° HPMFCT- 
2000-00709. 


1 






























2 


MARTIN SOMBRA 


The main result of this paper is the following upper bound for the height of the 
resultant: 

Theorem 1.1. 

1 ” 

/i(Res^o,...,^„) < MV(Qo,..., Q^-i, Q^+i, ...,Qn) log(#A). 

We write for short Res^ := Res^o,...,^^ and MVi{A) := MV((5o, • • •, Qi-i, Qi+i, 

..., Qn) for i = 0,..., n. Thus the previous result can be rephrased as 

n 

^(Res^) < . 

i=0 

This improves our previous bound for the unmixed case [3om02, Cor. 2.5] and extends 
it to the general case. We remark that the obtained upper bound is polynomial in the 
size of the input family of supports A and in the mixed volumes MVi{A) , and hence it 
represents a truly substantial improvement over all previous general estimates. These 
are the ones which follow either from the Canny-Emiris type formulas (see Inequality 
(^) in the appendix, see also pCPSOl] , Prop. 1.7], |l^j00| , Thm. 23]) or from direct 
application of the unmixed case (see the inequality (|3[) below for A: = 1). 

We also consider the Mahler measure, which is another usual notion for the size of a 
n-variate polynomial. The Mahler measure of 5 G C[xi,..., Xn] is defined as 


m{g) := / log | 5 r| , 

Js'i 


where Si C C is the unit circle, and dg is the Haar measure over Si of total mass 1. 
This can be compared with the height: in our case 


( 1 ) 


'^MVi{A) log(mi) < m(Res^) -/i(Res^) < '^MVi{A) log(mj 


i=0 


i=0 


by HKPSOIJ , Lem. 1.1]. We refer to ]|KPS01|, Sec. 1.1.1] for an account on some of the 
notions of height of complex polynomials: just note that the height h{g) here coincides 
with log |(7|oo in that reference. 

We obtain the same estimate as before for the Mahler measure of the resultant. 


Theorem 1.2. 

m{ResAo,...An) < 


: La] to 


^ ^ MV {Q01 • • • ) Qi—l, Qi+l: • • • ) Qn) ^Og(^Ai). 


Note that this improves by a factor of 2 the estimate which would derive from direct 
application of Theorem LT and the inequalities ([I|) above. 

Both estimates are a consequence of the following: 

Lemma 1.3. Let fo € 0 "^°,..., fn G C"^". Then 

1 

log|Res^o,...,^„(/o,...,/n)| < ^MV(go,---,<3i-i,Qi+i,---,Qn) log||/i||i, 

where ||/i||i := Yla&Ai denotes the ii-norm of the vector fi = {fia ; a G Ai). 
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Let 5 = Ec 


G C[a;i,, 


Xr^. Then for a G N"" we have that 
g{zi,...,zn) 


Ca = 


f gjzi, 


ttn + l 
' '^n 


dg^- 


by Cauchy’s formula and so h{g) < sup { log \g{C )\; C ^ ^i}- Thus Theore m pTl] is a 
consequence of this inequality applied to g' := Res^ii, together with Lemma [l.^ 

On the other hand, Theorem 1.2 follows from Lemma 1.3 by a straightforward esti¬ 
mation of the integral in the definition of the Mahler measure. 


Proof of Lemma |j.4 -~ Let A: G N. Then let /sM* C Z"" denote the pointwise sum of 
k copies of Ai, and set k A := {k Aq, ..., k An)- It is easy to verify that kA is also 
essential, and Conv(A:Mj) = kQi. 

We identify each fi G with the corresponding Laurent polynomial fi = EaeA 
and we set f^ G for the vector which corresponds to the k-th power of fi- By 
the factorization formula for resultants [ PS93| , Prop. 7.1] we get that 

Resfc^(/o^ ..., /^) = Res^(/o,..., 


and so 

n 

fc"+Mog|Res^(/o,...,/„)| < h{ReskA) + Y,MViikA)log\\ft\\i 

i=0 

n 

(2) < h(Resfc^) + A:’^+^ log||/,||i. 

i=0 

The first inequality follows from the straightforward estimate \G{uo, ..., Un)\ < H{G) 
nr=o 11'*^*111* for a multihomogeneous polynomial G of degree di in each group of vari¬ 
ables, applied to G := Res^^ and Ui := f^. The second one follows from the linearity 
of the mixed volume, and the sub-additivity of the .^i-norm with respect to polynomial 
multiplication (which implies that log ||/f ||i < k log ||/i||i). 


Now let B C Z"’ be any finite set such that Lg = IP and such that Aq, ..., An C B. 
Set n{k) := ifkB and P := Conv(B) C MP. Then the (unmixed) resultant Resfcg 
is a polynomial in (n -|- 1) n{k) variables and total degree (n -|- 1) Vol(A: P) = {n + 
1) A;"' Vol(P). We have also that = Z” and so we are in the hypothesis of | Som02 , 
Cor. 2.5], which gives the height estimate 

/i(Resfcg) < 2 (n -|- 1) log(n(A:)) Vol(A: P) = 2{n + l) log(n(A;)) A:” Vol(P). 

We have that kAi C A:B for z = 0,... ,n and so by | Stu94 Cor. 4.2] there exists a 
monomial order ^ such that Res^^ divides the initial form init_<(Resfcg). This is a 
polynomial in (n -|- 1) n{k) variables of degree and height bounded by those of Res^g, 
and so 


hfReskA) < ^(Resfeg) -|- 2 log ((n -|- 1) n{k) -|- l) (n -|- 1) A:” Vol(P) 
(3) < 4 (n-|-1) log ((n-|-1) n(A:)-|-l) A;"'Vol(P) 


by the inequality h{f) < h{g)+2 deg{g) log(n-t-l), which holds for f,g& Z[xi,..., 
such that f\g (see iKPSOlj , Lem. 1.2(l.d)]) applied to / := Resfc _4 and g 
init^(Resfcg). 


Xr, 


Finally we set B := b + d[0, 1]” C M” where [0,1] denotes the unit interval of M, 
for some 6 G Z” and d G N such that Ao,...,An C 6-|-(i[0,1]. Then n{k) = 
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log {^{kh + k d [0,1]” nZ”)) = log(A: d+ I)"" = Ofc(log k) (here the notation Ok refers 
to the dependence on k) and so 

/i(Resfe^) = logk). 

Alternatively, we could have obtained this from the inequality in the appendix. 
Toghether with the inequality (^) this implies that 

log|Res^(/o,...,/n)| < '^MVi{A) log ||/i||i + Ofc , 

i=0 

from where we conclude by letting k ^ oo. □ 


Let us consider some examples. For short we set H{A) := if(Res^) and E{d) ;= 


q{A) := 


log ^(-4) 
logiL(^) 


for the quotient between the height of the resultant and the estimate from Theorem 


0 . 


Example 1.1. Sylvester resultants. For d £ N we let 

Ao{d) = Ai{d) := {0,1, 2,... , d} C Z. 

The corresponding resultant coincides with the Sylvester resultant of two univariate 
polynomials of the same degree d. In this case MVo{d) = MVi{d) = d and i^Ao{d) = 
^Ai{d) = d + 1, and so E{d) := E{^Ao{d), Ai{d)) = (d + 1)^'^. 

We compute the height H{d) := H(^Ao{d), Ai{d)) for 2 < d < 7 with the aid of Maple, 
and we collect the results in the following comparative table: 


d 

2 

3 

4 

5 

6 

7 

Hid) 

2 

3 

10 

23 

78 

274 

Bid) 

81 

4,096 

390,625 

60,466,176 

13,841,287,201 

4,398,046,511,104 

qid) 

6.33 

7.57 

5.59 

5.71 

5.35 

5.18 


Example 1.2. We take this example from ||EM99| , Exmpl. 3.5]. Let 

A := {(0,0), (1,1), (2,1), (1,0)}, 

A := {(0,1), (2,2), (2,1), (1,0)}, 

A := {(0,0), (0,1), (1,1), (1,0)}. 

Then MVq = 4, MVi = 3 and MV 2 = 4, so that E{A) = 4^ 4^ 4^. On the other 
hand, we can compute the resultant using its expression in ||EM99| , Exmpl. 3.19] as a 
quotient of determinants, and we obtain that El (A) = 8. Hence 

H{A) = 8 , E{A) = 4,194,304 , q{A) = 7.33 . 

For reference, the straightforward estimation of the Canny-Emiris formula gives (see 
the appendix below): 

H{A) < 4*^^ = 4,835,703,278,458,516,698,824,704 . 
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Example 1.3. We take this example from ptu94 , Exmpl. 2.1], Let 

A := {(0,0),(2,2),(1,3)}, 

.Ai := {(0,1),(2,0),(1,2)}, 

A2 := {(3,0),(1,1)}. 

Then MVq = 5, MVi = 7 and MV 2 = 7, so that E{A) = 3® 3^ 2^. From the explicit 


monomial expansion of the resultant (see |Stu94, Exmpl. 2.1]) we find that H{A) = 14 
and so 


H{A) = 14 


E{A) = 68,024,448 


q{A) = 6.83 


These examples show that there is still some room for improvement over Theorem 0. 
It is however possible that our estimate is quite sharp anyway: in spite of the large 
difference between H{A) and E{A) in the computed examples, the quotient q{A) is 
quite small, and moreover it does not seem to grow when E{A) —>■ 00 . 

In any case, it would be very interesting to have an exact expression for /i(ReSyi) — 
as remarked to me by B. Sturmfels — or at least a non trivial lower bound. Note that 
the only information that we dispose about the exact value of the coefficients of Res^ 


is for the extremal ones, which are equal to ±1 [3tu94, Cor. 3.1] 


Remark 1.4. While a first version of this paper was circulating, C. D’Andrea (per¬ 
sonal communication) obtained a non trivial lower bound for the height of the Sylvester 
resultant, and an improvement of the upper bound to H{Kes{f, g)) < max{deg(/)!, deg( 5 )!}. 




A hnal remark with respect to the Mahler measure. Let Xj\^ C x • • • x P' 

be the projective toric variety associated to A and let Ci denote the equivariant line 
bundle which corresponds to >1* for i = 0,..., n as defined in | GKZ94 , Section 8.1]. 
Then the ^-resultant coincides with the (Tq) • • •) 7in)-resultant of X_ 4 , see | GKZ94 
Ch. 8, Prop. 1.5]. 

In the context of Arakelov geometry, it is possible to equip these lines bundles with a 
canonical Hermitian metric following | Mai0C| , Section 3.3]. Then it is then natural to 
conjecture that 

m(Res^o,...,^„) = 

where h-^^ denotes the multiheight of X_a with respect to the Hermitian 

line bundles Cq, ..., Cn- This is suggested by an analogous result for the height of a 
projective variety with respect to the Fubini-Study metric, see e.g. ] Sou92| , Thm. 3] 


or [ Phi91 , Cor. 2.4]. 

If this is the case, together with [ Mai00| , Prop. 7.11] this would improve Theorem 1.2 
above to m(Res^(,^.,._^„) = 0. 


Appendix: Estimation of the height via the Canny-Emiris formula 

For purpose of easy reference, we establish herein the estimate for /i(Res^) which 
follows from the Canny-Emiris formula and the standard estimates for the behavior 
of the height of polynomials under addition, multiplication and division. 

Assume that L _4 = 7A and set Q := Qi ^ Given a coherent mixed subdivi¬ 
sion of Q and a set 

S :=iQ +6) n lA, 

where 5 G Q"" is sufficiently small and generic, there is a family of Canny-Emiris 
(square non singular) matrices A4o, ■ ■ ■ j Ain- 
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For j = 0,..., n, the given subdivision of Q splits the set S into a disjoint union 
£ = £o{j) Li ■■■ [J £n{j) ■ The elements in £ are in bijection with the rows of Aij, and 
to each p G £i{j) corresponds a row of M.j with exactly rrii non zero entries, which 
consist of the variables in Ui := {Uia ; a G Ai}. We refer to |CL095, Sec. 7.6] for the 
precise construction. 

Set Dj := det(7Wj) G Z[f7o,..., t/„] \ {0}. The Canny-Emiris formula |CL095, Ch. 
7, Thm. 6.12] states that Res^i = gcd(ilo) ■ ■ ■) Dn). 

Then Dq is a multihomogeneous polynomial of degree W := i^£i{0) in each group of 
variables Ui and of height bounded by log (mi). We have that Resell Hq and 

so m(ReSyi) < m(Zlo), which combined with ][KPS01| , Lem. 1.1] gives 

n n 

(4) /i(Res^) < h{Di,) + Y,{Ni + MVi{A)) log(m,) < (2 W + MRi(^)) log(m,). 

Applied to Example |1.2| , this gives the stated estimate: then A^o = W = 4 and N 2 = 7 
(see I EM99| , Exmpl. 3 . 5 ]) and so the previous estimate gives H{A) < 42-15+11 _ 441 _ 

In general, the estimate so obtained is much worse than that of Theorem 0, especially 
for n 0. Consider e.g. Ai := {0,... , d}"- C Z” for i = 0,..., n. Then it is easy to 
show that Inequality (§) gives 

/i(ReSyi) < (2 ((n + 1) d)” + (n + 1) d”) log(d + 1) 

while Theorem |lI| gives /i(Res^) < (n + 1) dE log(d + 1). 
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